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Abstract. Pancake Flipping is the problem of sorting a stack of pancakes of different 
sizes (that is, a permutation), when the only allowed operation is to insert a spatula 
anywhere in the stack and to flip the pancakes above it (that is, to perform a prefix 
reversal). In the burnt variant, one side of each pancake is marked as burnt, and it 
is required to finish with all pancakes having the burnt side down. Computing the 
optimal scenario for any stack of pancakes and determining the worst-case stack for 
any stack size have been challenges over more than three decades. Beyond being an 
intriguing combinatorial problem in itself, it also yields applications, e.g. in parallel 
computing and computational biology. 

In this paper, we show that the Pancake Flipping problem, in its original (unburnt) 
variant, is NP-hard, thus answering the long-standing question of its computational 
complexity. 
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1 Introduction 



The pancake problem was stated in [7] as follows: 

The chef in our place is sloppy, and when he prepares a stack of pancakes they come out all different 

sizes. Therefore, when I deliver them to a customer, on the way to the table I rearrange them (so 
that the smallest winds up on top, and so on, down to the largest at the bottom) by grabbing several 
from the top and flipping them over, repeating this (varying the number I flip) as many times as 
necessary. If there are n pancakes, what is the maximum number of flips (as a function of n) that 
I will ever have to use to rearrange them? 

Stacks of pancakes are represented by permutations, and a flip consists in reversing a prefix of any length. 
The previous puzzle yields two entangled problems: 

• Designing an algorithm that sorts any permutation with a minimum number of flips (this optimization 
problem is called MIN-SBPR, for Sorting By Preflx Reversals). 

• Computing /(n), the maximum number of flips required to sort a permutation of size n (the diameter 

of the so-called pancake nef,work). 

Gates and Papadimitriou [9] introduced the burnt variant of the problem: the pancakes are two-sided, 
and an additional constraint requires the pancakes to end with the unburnt side up. The diameter of the 
corresponding burnt pancake network is denoted g{n). A number of studies [4, 5, 6, 9, 11, 12, 13] have aimed 
at determining more precisely the values of /(n) and g{n), with the following results: 

• /(n) and g{n) are known exactly for n < 19 and n < 17, respectively [5]. 

• 15n/14 < f{n) < 18n/ll + (9(1) [12, 4]. 

• [(3n + 3)/2j < g{n) < 2n - 6 [5] (upper bound for n > 16). 

Considering MIN-SBPR, 2-approximation algorithms have been designed, both for the burnt [6, 8] and 
unburnt [8] variants. Moreover, Labarrc^ and Cibulka [13] have characterized a subclass of permutations, 
called simple permutations, that can be sorted in polynomial time. 

The pancake problems have various applications. For instance, the pancake network, having both a small 
degree and diameter, is of interest in parallel computing. The algorithmic aspect, i.e. the sorting problem, 
has applications in comparative genomics, since prefix reversals are possible elementary modifications that 
can affect a genome during evolution. A related problem is Sorting By Reversals [1] where any subsequence 
can be flipped at any step, not only prefixes. This problem is now well-known, with a polynomial-time exact 
algorithm [10] for the signed case, and a 1.375-approximation [2] for the APX-hard unsigned case [3]. 

In this paper, we prove that the MIN-SBPR problem is NP-hard (in its unburnt variant), thus answering 
an open question raised several decades ago. We in fact prove a stronger result: it is known that the number 
of breakpoints of a permutation (that is, the number of pairs of consecutive elements that are not consecutive 
in the identity permutation) is a lower bound on the number of flips necessary to sort a permutation. We 
show that deciding whether this bound is tight is already NP-hard. 

2 Notations 

We denote by [[a ; bj the interval {a, a + 1, . . . , &} (for b < a,, we have [[a ; 6] =0). Let n be an integer. Input 
sequences are permutations of [[1 ; n], hence we consider only sequences where all elements are unsigned, and 
there cannot be duplicates. When there is no ambiguity, we use the same notation for a sequence and the 
set of elements it contains. We use upper case for sets and sequences, and lower case for elements. 

Consider a sequence S of length n, S = (:ri, X2, . . . , a;„). Element xi is said to be the head element 
of S. Sequence S has a breakpoint at position r, 1 < r < n ii Xr ^ Xr+i — 1 and Xr ^ Xr+i + 1. It has 
a breakpoint at position n if a;„ ^ n. We write db{S) the number of breakpoints of S. Note that having 
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Figure 1: Examples of efBcient flips. Sequence (5, 2, 3, 1, 4) is eSiciently sortable (in four flips), but 
(5, 2, 3, 4, 1) is not. 

Xi 1 does not directly count as a breakpoint, and that di,{S) < n for any sequence of length n. For any 
p < q £N,we write the sequence (p, p+ 1, p + 2, . . . , g). is the identity. For a sequence of any length 
S = (xi, X2, ■ ■ ■ , Xk), we write *S the sequence obtained by reversing 5: *S = (xk, Xk-i, ■ ■ ■ , xi). Given 

an integer p. we write p + S = {p + Xi, p + X2, . . . , p + Xk). 

The flip of length r is the operation that consists in reversing the r first elements of the sequence. It 
transforms 

S — ^^1, X2i . • . , ^rj * • * ? •^n^ 

into 

S = (^Xj-^ ^Ty—i, . . . , Xi, . . . , ^n)* 

Note that the flip of length 1 does not modify S, and the flip of length n transforms S into *S. 

Property 1. Given a sequence S' obtained from a sequence S by performing one flip, we have db{S')—db{S) G 
{-1,0,1}. 

A flip from S to S' is said to be efficient if dt{S') = di,{S) — 1, and wc reserve the notation S ^ S' for 
such flips. A sequence of size n, different from the identity, is a deadlock if it yields no efficient flip, and we 
write 5 — )• _L. By convention, we underline in a sequence the positions corresponding to possible efficient 
flips: there are at most two of them, and at least one if the sequence is neither a deadlock nor the identity. 

We call path a series of flips. A path is efficient if each flip is efficient in the series. A sequence S is 
efficiently sortable if there exists an efficient path from S to the identity permutation (equivalently, if it can 
be sorted in db{S) flips). Sec for example Figure 1. 

Let S be a sequence different from the identity, and T be a set of sequences. We write S T if both 
following conditions are satisfled: 

1. for each T e T, there exists an efficient path from S to T. 

2. for each efficient path from S to the identity, there exists a sequence T e T such that the path goes 
through T. 

If T consists of a single clement (T = {T}), we may write S =^ T instead of S {T}. Note that 
condition 1. is trivial if T = 0, and condition 2. is trivial if there is no efficient path from S to I^. Note 
that given a sequence S, there can be several different sets T such that S T. However, two are especially 
relevant: 

Property 2. Given any sequence S ^ X\, 

S =J> X\ <^ S is efficiently sortable. 
S => <^ 5 js not efficiently sortable. 



3 



Proof. For S =^ X^i'. condition 1. is true iff there exists an efficient path from S to the identity, that is S is 
efficiently sortable. Condition 2. is always true. 

For S => 0: condition 1. is always true. If there exists at least one efficient path from S to I^, then, 
since there exists no sequence T G 0, Condition 2. cannot be true. Hence Condition 2. is false when there 
exists an efficient path from S to the identity and true otherwise, so it is equivalent to the fact that S is not 
efficiently sortable. □ 

The following property is easily deduced from the definition. 

Property 3. If S ^ {81,82}, 81 =^ Ti and 82 T2, then 5 ^ Ti U T2. 



3 Reduction from 3-SAT 

The reduction uses a number of gadget sequences in order to simulate boolean variables and clauses with 
subsequences. They are organized in two levels (where level-1 gadgets are directly defined by sequences of 
integers, and levcl-2 gadgets arc defined using a pattern of lcvcl-1 gadgets). For each gadget we define, we 
derive a property characterizing the efficient paths that can be followed if some part of the gadget appears 
at the head of a sequence. 

We have not aimed at providing the smallest possible gadgets (the overall reduction for a formula con- 
taining / variables and k clauses creates a stack of 311 + 98k elements with 161 + 50k breakpoints), and we 
preferred straightforward proofs and easy-to-combine gadgets over short sequences. A rough analysis shows 
that the final stack size could easily be reduced to 221 + 71k, with the same number of breakpoints. 

3.1 Level-1 gadgets 
3.1.1 Docks 

The dock gadget is the simplest we define. Its only goal is to store sequences of the kind (with p < q) 

out of the head of the sequence, without "disturbing" any other part. 

Definition 1. Given two integers p and q with p < q, the dock for is the sequence 

Dock{p, q) = D 

where D = (p — 1, g + 1, g + 2). 

It has the following property: 

Property 4. Let p and q be any integers with p < q, D = Dock{p, q), and X and Y be any sequences. We 
have 

Y ^ 

A p-i 

D 9+2 

Y ^ 

Proof. An efficient path from (*XP'^^, X, D, to (X, 2^+2 > given in Figure 2. For each sequence in 

the path, we apply the only possible efficient fiip, hence every efficient path between X, D, F) and 

(if such a path exists) begins with these two flips, and goes through (X, Ig+2 ' ^ 



4 



*-Tp+l 

X 
D 
Y 



q-l 



P 



X 



p + 2 
p+1 

X 
p-1 

JL 

q+1 q+1 
q+2 q+2 



p — 1 p — i 
*x p 

p+i p+i 

p+2 p+2 



Y 



q-l q-l 

q q 

q + 2 
Y 



X 
Y 



Y 



Figure 2: Proof of Property 4. (Dock gadget) 



3.1.2 Lock 

A lock gadget contains three parts: a sequence which is the lock itself, a key element that "opens" the lock, 
and a test element that checks whether the lock is open. 

Definition 2. For any integer p, Lock{p) is defined by 

Lock{p) = {key, test, L) 

where key = p + 10 

test = p + 7 

L = 2, 9, 8, 5, 6, 4, 3, 11, 12) 



Given a lock {key, test, L) = Lock{p), we write 

L''=p+ (1, 2, 3, 4, 6, 5, 8, 9, 10, 11, 12>. 

Sequences L and L° represent the lock when it is respectively closed or open. If a sequence containing a 
closed lock has key for head element, then efficient flips put the lock in open position. If it has test for head 
element, then it is a deadlock if and only if the lock is closed. 

Property 5. Let p be any integer, {key, test,L) = Lock{p), and X and Y be any sequences. We have 

key ^ test test 

a ^ ^ h ^ => 7^+^ c ^ ^ \ 

a. ^ b. ^ c. ^ ^ ^ 

Y Y ^ Y 

Proof. See Figure 3. Note that for readability reasons, the proof is given for p = 0. It can obviously be 
extended to any value of p (each element would then be increased hy p). □ 

We use locks to emulate literals of a boolean formula: variables "hold the keys" , and in a first time open 

the locks corresponding to true literals. Each clause holds three test elements, corresponding to its three 
literals, and the clause is true if the lock is open for at least one of the test elements. 



3.1.3 Hook 

A hook gadget contains four parts: two sequences used as delimiters, a take element that takes the interval 
between the delimiters and places it in head, and a put element that does the reverse operation. Thus, the 
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Figure 3: Proof of Property 5. (Lock gadget) 
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sequence between the delimiters can be stored anywhere until it is called by take, and then can be stored 

back using put. 

Definition 3. For any integer p, Hook{p) is defined by 

Hook{p) — {take, put, G , H) 

where take = p+lQ 

put = p + 7 

G = p+<3,4) 

H = p +(12, 11, 6, 5, 9, 8, 2, 1). 
Given a hook {take, put, G,H) = Hook{p), we write 

G' = p+ (12, 11, 6, 5, 4, 3) 

H' = p+(lO, 9, 8, 2, 1) 

G" = p + (3, 4, 5, 6, 7) 

H" = p + (12, 11, 10, 9, 8, 2, 1). 

Property 6. Let p be an integer, {take, put, G,H) = Hook{p), and X, Y and Z be any sequences. We have 
take ^ put ^ 

G' G" X 



Y — ^ ^ *Y — H" — p+12 



H % H' ^" Y ^ 

Z ^ Z ^ 

Proof. See Figure 4 (with p = Q). □ 



3.1.4 Fork 

A fork gadget implements choices. It contains two parts delimiting a sequence X. Any efficient path 
encountering a fork gadget follows one of two tracks, where either X or *X appears at the head of the 
sequence at some point. Sequence X would typically contain a series of triggers for various gadgets {key, 
take, etc.), so that X and *X differ in the order in which the gadgets are triggered. 

Definition 4. For any integer p, Fork{p) is defined by 

Fork{p) = {E, F) 
where E = p+(ll, 8, 7, 3) 

F = p+ (10, 9, 6, 12, 13, 4, 5, 15, 14, 2, 1>. 

Given a fork {E,F) = Fork{p), we write 

F^ = 39+ (10, 9, 6, 7, 8, 11, 12, 13, 14, 15, 5, 4, 3, 2, 1) 
^2 = p+ (3, 7, 8, 11, 10, 9, 6, 12, 13, 4, 5, 15, 14, 2, 1) 



Property 7. Let p be an integer, {E,F) = Fork{p), and X, Y be any sequences. We have 

F Yy Y Y ^ Y 
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Figure 4: Proof of Property 6. (Hook Gadget) 
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Proof. See Figures 5 and 6 (with p = Q). 



□ 



3.2 Level-2 gadgets 

In this section, wc define new gadgets based on the four levcl-1 gadgets. From now on, eacli property 
proof uses exclusively properties from smaller gadgets. In order to help the reader follow the ever-present 
references, we use the following notations. Bold font is used to emphasise the "active" parts of the gadget 
currently having an element at the head of the sequence. For each relation S T, wc give the relevant 
reference below (e.g. S =j=> T if it is obtained from Property 4). Finally, a summary of all gadget properties 
(either level-1 or -2) is given in Figure 7. 



3.2.1 Literals 

The following gadget is used only once in the reduction. It contains the locks corresponding to all literals of 

the formula. 

Definition 5. Let p and m he two integers, Literals{p,m) is defined by 

Literals{p,m) = (key^^, . . . ,keyj^,testi, . . . ,testm,^) 
where A = (ii, L2, . . . , Lm) 
Vi e II ; m] , (keyi, testi, Li) = Lock{p + I2{i - 1)) 

Let O and I be two disjoint subsets of [1 ; m] . We write the sequence obtained from A by 

• replacing Li by L° for all i £ O, 

• replacing Li by 2'p+i2i~^^ foi^ Qi^^ i € L. 

Elements of O correspond to open locks in Af , while elements of / correspond to open locks which have 
moreover been tested. Note that Ag = A, and that A^.^j = 1^Xi2m- 

Property 8. Let p and m he two integers, (keyi , . . . , key„^, testi , . . . , testm, A) = Literals{p, m), O and I be 
two disjoint subsets of {l; rnj, and X be any sequence. We have 



keyi 

a. Vi e II ; m] - O - /, X 

Af 



X 



testi 

b. yiGO, X ==> .o-{i} 

Af ^u{i} 

testi 

c. Vi e II ; m] - O, X -s- _L 

Af 

Proof. The proof follows from Property 5. 

a. Let i e {1; mj - O - L. Then Af can be written Af = (A, Li, B). Hence 

{key,, X, Af) 
= (kevi, X, A, Li, B) 
If A LI B) 
= {X, A°^^^>) 
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Figure 5: Proof of Properties 7. a and 7.b (Fork gadget). 
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Figure 6: Proof of Property 7.c (Fork gadget). 
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Dock gadget 

{*X^+\X, D, Y)^{X,^-lY) 

Lock gadget 
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5.C 
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(F^ Y) ^ {*IIXI„ Y) 
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Vz^OU/, {key,.X., A?) =^ (X, A°^W) 
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Vi ^ O, (testi, X, A?) ± 

^ ' 8.C 
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(-'^'^^^'^?)if{lx,v>Y:f^ 
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Figure 7: Compilation of all gadget properties. As a general rule, X, Y, Z can be any sequences, O and I 
any disjoint subsets of [l ; m] . See respective definitions and properties for specific constraints and notations 
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b. Let i e O. Then Af can be written Af = {A, L?, B). Hence 

{testi, X, Af) 

= {tesU, X, A, LI, B) 

c. Let i £ \l;m\ — O. If i e /, then iesij € '^pXi2V ^ ^^"^ {t^sti, X, Af) is not a vaUd 
sequence (it contains a dupUcate). Otherwise, i e |[1 ; m] — O — /, and Aj^ can be written = (A, Li, B). 
Hence 

(iesii, X, Af) = {tesU, X, A, Li, B) ^ ± 

□ 

3.2.2 Variable 

In the following two sections, we assume that pA a-nd to are two fixed integers, and we define the L«ieraZ,s(,g)adget 
{keyi, . . . , key„^, testi, . . . , test„i, A) = Literals{px,m). Thus, we can use elements key,i and testi for i € 
[1 ; to], and sequences A^ for any disjoint subsets O and / of |1 ; to]. 

We now define a gadget simulating a boolean variable .t,;. It holds two scries of key elements: the ones 
with indices in P (resp. N) open the locks corresponding to literals of the form Xi (resp. ~^Xi). When the 
triggering element, v, is brought to the head, a choice has to be made between P and A'', and the locks 
associated with the chosen set (and only them) are open. 

Definition 6. Let P, N be two disjoint subsets of |1 ; to] (P = {pi,P2, ■ ■ ■ ^ = {ni,n2, . . . ,nq'}) and 
p be an integer, Variable{P, N, p) is defined by 

Variable{P, N,p) = {v, V, D) 
where {take, put, G, H) = Hook{p + 2) 
{E, F) = Forkip + 14) 
in V = take 

V = {G, E, kevp^ , kevp^ , put, key^^ , key^^, , F, H) 
D = Dock{p + 2,p + 2Q) 
Given a variable gadget {v,V,D) = Variable{P, N , p) , we write 

V' = {G",key„^,...,key^^,,F\H") 
= {G",key^^,...,key^^,F\H") 

where G", H" , F^, F"^, come from the definitions of Hook (Definition 3) and Fork (Definition 4)- 

The following property determines the possible behavior of a variable gadget. It is illustrated by Figure 8. 

Property 9. Let P, N be two disjoint subsets of |1 ; to], p be an integer, X and Y be two sequences, O, 
I be two disjoint subsets of\l ; to], and {i',V,D) = Variable{P, N ,p) . For sub-property (a.) we require that 
{P U N) r\ {O Ul) = 0, for (b.) that Nn{OUl) = 9, and for (c.) that P n (O U /) = (these conditions 
are in fact necessarily satisfied by construction since all sequences considered are permutations). We have 



X 
a. V 

Y 
Af 



V ...... V- ^ VI ^ 



X 


X 






Y 


Y 



N 



X p+i X 

b. D ^ c. D ^ 

Y ^ Y ^ 

Af A/ Aj 
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Figure 8: Initially, a variable gadget contains mainly the sequence V. Property 9a proves that two paths 
are possible, leading to sequences containing either or V^. Along the first (resp. second) path, the locks 
with indices in P (resp. N) are opened. By Property 9b (resp. c), there exists a path transforming (resp. 
y^) into the identity over |[p + 1 ; p + 31], which opens the remaining locks. 



Proof. 

a. {ly, X, V, Y, Af) 

= (take, X, G, E, key^^, key^^, put, key^^, key^^,, F, H, Y, Af) 
^ (E, keyp^, key^^, put, key„^, key^^,, F, G' , *X, H' , Y, Af) 
{81,82} (where sequences and ^2 are described below) 

First, 

= (keyp^, keyp^, key^^, put, key^^, key^^^^, F\ G', *X, H' , Y, Af) 
^ {key^^, key^^, put, key^^, key^^„ F\ G, *X, H' , Y, A°^^p^>) 

=, {put, key^^, ...,key^„F\ G' , *X, H' , Y, A^^^) 

8. a ^ Q 

{X, G", key^^, key F\ H", Y, A?^^) 

6.b ^ ^ 1 ' 

= {X, V\ Y, Af^P) 
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Second, 

52 = {key^^,, keyn^,_^, key^^, put, key^^, key^^, F'^,G', *X, H', Y, Af) 
^ {key^^,_^, key^^, put, key^^, key^^, F^,G', *X, H', Y, A^"^^"""'^) 

^ (put, keyj,^, keyj,^, F\ G' , *X, H' , Y, A^^^) 
^ {X, G", key^^, key^^, F\ H" , Y, A?^^) 
= {X, V^, Y, Apu^) 



b. {V\ X, D, Y, Af) 

= (G", key„^, key„^„ F\ H" , X, D, Y, Af) 

^ (key^^, key^^, key^^„ F\ ^^l, X, D, Y, Af) 

=^ {key^^,...,key,,^„ F\ X, D, Y, A?^{"^>) 

^ {F\ ^mi, X, D, Y, Kf^^) 
=^{X,X^^ll„Y,Af^^) 



c. (y^ X, D, Y, Af) 

= {G", keyp^, key^^, F\ H" , X, D, Y, Af) 

^ {key^^, key^^_^, key^^, F^ X, D, Y, Af) 

^ {key^^_^, key^^, F\ ^I^l, X, D, Y, A°^{^«>) 

^ (F^ ^X;Xl„ X, D, Y, Af^P) 
^{X,IlXl,Y,Af-^) 

□ 

3.2.3 Clause 

The following gadget simulates a 3-clause in a boolean formula. It holds the test elements for three locks, 
corresponding to three literals. When the triggering element, 7, is at the head of a sequence, three distinct 
efficient paths may be followed. In each such path, one of the three locks is tested: in other words, any 
efficient path leading to the identity requires one of the locks to be open. 
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Sq 5; 



teste 



testa 



testb 







S4 


S5 


testb 




testa 




testa 


testb 




teste 



X. 




Figure 9: Initially, a clause gadget contains mainly the sequence T. Property 10 proves that three paths may 
be possible, leading to sequences containing cither F^, or r'"*. Because of the test elements, each path 
requires one lock to be open (either o, 6 or c). By Property 11a (resp. b, c), there exists a path transforming 
(resp. r^, r^) into the identity over |p + 1 ; p + 62], provided the remaining locks are open. 



Definition 7. Let a,b,c G [1 ; m] be pairwise distinct integers andp be an integer, Clause{a, b, c,p) is defined 
by 

Clause{a,b,c,p) = (7, r,A) 

where (£'i,Fi) = Fork{p + 2) 

{E2,F2) = Fork{p + i5) 

{takei,puti,Gi,Hi) = Hook{p + 21) 

{tak&2,put2-i G2, -^2) = Hook{p + 33) 

Di = Dock{p + 2,p+ 17) 

D2 = Dock{p + 21,p + 60) 

OT 7 = takei 

r = (Gi, El, take2, put-^, teste, Fi, G2, E2, testa, put2, testb, F2, H2, Hi) 

A = (Di,D2) 

Given a clause gadget (7, P, A) = Clause{a,b,c,p), we write 

pi = {G'{, teste, Fl,G'l, testb, Fl,H'^,H'() 
P2 = {G'{, teste, Fl, G'i, testa. F^, H'^, H'l) 
P^ = {G'i, take2, F^, G2, E2, testa, put2, testb, F2, H2, H'l) 

The following two properties determine the possible behavior of a clause gadget. They are illustrated by 
Figure 9. 
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Property 10. Let X and Y be any sequences, and O, I be two disjoint subsets of |1 ; m]. We have 

7 
X 

r 

Y 
Af 

where T contains from to 3 sequences, and is defined by: 

XXX 

pi p2 p3 

Y eTiffaeO Y GTiffbGO y eTiffceO 

aO-W /^o-m \0-{<^} 

Proof. 

(7, X, r, r, Af) 

= (take^, X, Gi, Ei, take2, put-^, teste, Fi, G2, E2, testa, put2, testf,, F2, H2, Hi, Y, Af) 
=> (£^1, take2, put^, teste. Fx, G2, E2, testa, pw^, testb, F2, H2, G[, *X, H[, Y, Af) 

6. a 

^{^0,^3} 

^0 = {take2, puti, teste, Fl, G2, E2, testa, puh^ testt, F2, H2, G[, *'X, H[, Y, Af) 
=^ (E2, testa, put2, testb, Fa, G2, *Fl, teste, put^, H'2, G[, *X, H[, Y, Af) 

6. a ^ ' 

{81,82} 

7. a 

51 = {testa, put2, testb, F^, G'2, *F^, teste, put^, H'2, G[, *X, H[, Y, Af) 
a a 4 O then Si ± 

^ 8.C 



51 ^ {put,, testb, Fl G;, *Fl, teste, puti, H'^, G'l, *X, H[, Y, A^J^g^) 
^ {putx, teste, Fl, G'l testb, Fl H'l G{, *X, H[, Y, A^j/;/) 

^ {X, G'l, teste, Fl G'l testb, Fl H'l H'l Y, A^j/;/) 

= Y, A?j/4>> 

52 = {testb, Put2, testa, Fl G'2, *FI teste, puh, H'2, G'^, *X, H'^, Y, Af) 
lib 40 then S2 ^ ± 

8.C 

if 6 e O then 

52 ^ {put,, testa, Fl Gl *Fl teste, puti, G'l, *X, H'^, Y, Afj^f^) 
^ {putx, teste, Fl G'l testa, Fl Hi G[, *X, H[, Y, Afj^^^^) 
^ {X, G'l teste, Fl G'l testa, Fl H'l H'l Y, A^'f/) 

= <x,^^y,A?J/4>> 
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Ss = {teste, puti, take2, F^, G2, E2, testa, put2, testb, F2, H2, €[, *X, H[, Y, Af) 
\ic4 then ^ 1. 

8.C 

if c e O then 

53 ^ {put,, take2, Fl, G2,E2, testa, put2, testb, F2, H2, G'l, *X, H[, Y, AfJ^[f) 
^ {X, G'l, take2, Fl, G2, E2, testa, put2, testb, F2, H2, H'l, Y, A°-|^^j>) 
= {X, T\ Y, A°,-§>) 



Property 11. Let Y and Z be any sequences, and O, I be two disjoint subsets of |1 ; m] . We have 



□ 



Proof. 



a. Ifb,c£0, then 



Y 
A 
Z 
Af 



Y 
Z 

aO-{6,c} 
^^/U{6,c} 



If a,c € O, then 



r2 

Y 
A 
Z 
A? 



If a,b € O, then 



r3 

Y 
A 
Z 
Af 




a. {T\ Y, A, Z, Af) 

= {G'l, teste, Fl, G'2', testb, F^, H'^, H'l, Y, D,, D2, Z, Af) 
^ {teste, Fl, G'l testb, F^, H'l *I^+|2^ y, D,, D2, Z, Af) 

^ {Fl, G'i, testb, Fl H'l ^I^+f,, Y, D,, D2, Z, Af^^^) 

{-Till,, G'l testb, Fl H'l Y, D,, D2, Z, A^j/;/ > 



7.b 
~4 



6.C 



=4- {G'l testb, FI H'l *TlXll, Y, X^+ig, -D2, Z, Af^^^^) 

l+oot, pi *tP+34 *tP+22 V TP+1 D 7 X^-i'^^X 

/pi ★7-P+34 *-7-p+22 ^ ^+1 7 .0-{b,c}\ 

\-'^2 ' -'-p+45' -'-p+SS' ^' -^2, ^, ■'»-/U{6,c} / 

/★-7-P+46 ★-7-P+34 *-.p+22 



7 \ '-p+eo ' ' -^p+33 ) ) -^1 

/V rp+'^ 7-P+20 7 .o-{6,c}\ 



'P+19 



73a 



^' ^Vu{6,c} / 



S+19' -''p+62' 

/V 7 a'^^{'''^}\ 

V ' -^p+62> ^' ^Vu{6,c} / 
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b. (r^, y, A, z, Af) 

= {G'l, test,, Fl G'l testa, Fl H^, H'^, Y, Di, D2, Z, Af) 
^ {teste, Fl, G'l testa, Fl H'l *J^+ll Y, D,, D^, Z, A°) 



c. (r3, Y, A, Z, Af) 

= {G'l, take2, Fl, G2, E^, testa, put^, testt, F2, H2, H'^' , Y, D^, D2, Z, Af) 
=^ {take^, Fl, G2, E2, testa, put2, testt, F2, H2, *^tll ^' ^i' ^2, Z, Af) 
^ {E2, testa, put2, testb, F2, G'2, *Fl, H'^, ^J-H^, Y, D^, D2, Z, Af) 

{5*4, S'5} 

7. a 

Si = {testa, put^, testb, F^ G'2, *FI, H'^, Y, D^, D2, Z, Af) 

=^ {put^, testb, Fl G;, *Fl, H'^, ^I^+H Y, D^, D2, Z, A^"/;/) 
^ {Fl, G'l testb, Fl Hi -i;Xll Y, D„ D2, Z, A^j/;/) 

(*2^p+ir' G'l testb, Fl H'l ^I-lXll^ -^i' F>2, Z, Af^^°}) 
{G'l testb, Fl Hi Y, ^l,, D2, Z, A?"/;/) 

=^ {testb, FI *2^+45, *2^+33 5 Y, 2^+19! F>2, Z, AJ^J^^^) 





. /Ell *7-p+22 -tr 'j-p+l n 7 \0-{a,b}\ 



_^ /★t-P+46 *-7-p+34 *^p+22 7-p+l r-) 7 

^ \ -^p+60' -^p+45' -^p+33' ^' -^p+19' -^2, ^ 1 




0-{a,b}\ 
■/U{a,b} / 
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8.b 



S5 = (testb, put,, testa, Fl G',, *Ff , H'^, *2^+3'|, Y, -D2, Z, A? > 
{put^, testa, Fl G;, H'^, *mil, Y, D,, D2, Z, A^J"/;/) 
{F^, G'i, testa, Fl Hi ^I^+H Y, D^, D,, Z, A^j/;/) 
jf (*^p+i7' ^2. *esia, Fl Hi *2^+33, Y, D^, D,, Z, A^^^^^j^) 

^ //"<// j.„„i p2 TT// ★'T-p+22 -7-P+I ;-) 7 »0-{a}\ 

\"'2 ' '•'^•''■a' ^2 1 -"2 ' -'-p+SS ' ' -^p+19' ^^/u{a} / 

. /f^,a/ p2 *7-p+34 *-7-p+22 n-p+1 y . 0-{a}\ 

i^lfibla, ^2 I -'•p+45' -'-p+aS' ^' -^p+19' ^2, ^, ■'*-/u{o} / 

' -^p+45' -^p+33' ^' -^p+19' -f-'2> ^7 ^^/u{a,b} / 

/★-7-P+46 ^^+34 7-p+l j-j 7 .0-{a,h}\ 

\ -^p+33' -^p+19' -^2j ^, ■"-7u{a,6} / 

^ /V rP+'^ -7-P+20 7 A'-'~'t"'''}\ 
{i, -i-p+19. -^p+62> •'Vu{a,6} / 

\I, -i-p+62> ^> •'Vu{a,6} / 



7.C 



□ 

3.3 Reduction 

Let (j)he a boolean formula over I variables in conjunctive normal form, such that each clause contains exactly 
three literals. We write k the number of clauses, m = 2>k the total number of literals, and {Ai, . . . , Am,} the 
set of literals. Let n = 31Z + 62fc + 12m (thus, n = 31Z + 98fc). 

Definition 8. We define the sequence as the permutation of |1 ; n] obtained by: 

{keyi, . . . , key^, testi, . . . , testm, A) = Literals{31l + 62k, m) 
For all i e [1 ; I] 

Pi = {j e [1 ; ml I A,- = Xi} 

N, = {j € [1 ; ml I Xj = -x,} 

y„ A) = Vanable{Pi, Ni, 31(i - 1)) 
For all i e [1 ; fcl 

{ai, hi, Cj) = indices such that the i-th clause of (p is Aa^ V A^^ V Ag. 

(7j,ri,Aj) = Clause{ai,bi,Ci,31l + 62{i - 1)) 

= {ui, ... ,1/1,^1, ... ,^k,Vi, ... ,Vi,Ti, ... ,Tk, Di, Di, Ai, Ak, A'^^) 

Two things should be noted in this definition. First, elements fce?/j and testi are used in the clause and 
variable gadgets, although they are not explicitly stated in the parameters (cf. Definitions 6 and 7). Second, 
one could assume that literals are sorted in the formula (</> — (Ai V A2 V A3) A . . . ), so that = 3i — 2, 
&i = 3i — 1 and Ci — 3i, but it is not necessary since these values are not used in the following. 

We now aim at proving Theorem 18 (p. 25), which states that is efficiently sortablc if and only if the 
formula (j) is satisfiable. Several preliminary lemmas are necessary, and the overall process is illustrated in 
Figure 10. 

3.3.1 Variable assignment 

Definition 9. Let r G JO; ^1. An r-assignment is a partition V = {T,F) of Jl ; r\. An I -assignment is 
called a full assignment. Using notations from Definition 8, we define the sequence S^\P] by: 
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{ui,...,iyi,-fi,...,jk,Vi,...,Vi,Ti,...,Tk,Di,...,Di,Ai,...,Ak,Al) 




Open locks in Pi 
Vi ^ V( = 



Open locks in A^i 
Vi ^ V( = 




U2 




Open locks in Pi 

Vi ^ v{ = 



Open locks in A''; 

VI « v; = V' 




Test lock ai 

Ti ^ r'l = ri 







Test lock hi 
Ti ^ r'l = Tl 







in 



Test lock ci 

Fi r'l = rf 

:7 




A 



Open remaining 
locks in Pi U A^i 
Di^X 



Open remaining 
locks in Pi U Ni 



Test remaining 
locks in {ai, 61, Ci} 



Test remaining 
locks in {afe,6fe,Cfe} 
Afe J 



I 

Figure 10: Description of an efficient sorting of S'^. Circular nodes correspond to head elements or sequences 
especially relevant (landmarks). We start with the head element of S^: vi. Prom each landmark, one, two 
or three paths are possible before reaching the next landmark, each path having its own effects, stated in 
rectangles, on the sequence. Possible effects are: transforming a subsequence of (symbol opening a 
lock, testing a lock (such a path requires the lock to be open). The top- left quarter, from vi to ui, is studied 
in Section 3.3.1; the bottom-left quarter, from 71 to 7^, is studied in Section 3.3.2; and the right half, from 
VI to r^, is studied in Section 3.3.3. Indices are removed from identity sequences {I) for readability. 
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o = y Pi u y 

iGT iG-F 

S^\P\ = (I'r+i, . . . , I'/, 71, . . . , 7fc, F/, . . . , y/, • • ■ , Vi, 

ri,...,rfc,Di,...,A,Ai,...,Afc,A^) 

Property 12. iei r G [0 ; with r < I, V = {T,F) be any r-assignment, Vi = (T U {r + 1},F) and 
V2 = {T,FU{r + l}). Then 

S^[V]^{S^[V,],S4,[V2]} 
Proof. This is a direct application of Property 9. a on variable (i/^+i, Fr+ij -Dr+i), using sequences: 

X = {vr+2, ...,n,li,---nk,Vi,..., vl) 

Y = {Vr+2,...,VuTu...,Tk,Du...,Du/^u---,^k) 

□ 

With the following lemma, we ensure that any sequence of efficient flips from begins with a full 
assignment of the boolean variables, and every possible assignment can be reached using only efficient ffips. 

Lemma 13. 

{S^fP] I V full assignment^ 

Proof. We prove Sff, => {S^\P] \ V r — assignment} by induction for all re |[0 ; Z] , and the lemma is deduced 
from the case r = I. 

There is only one 0-assignment, which is Vq = (0,0), and = S^[Po]- Consider now any r <l. We use 
notations Vi and V2 from Property 12. Then any (r + l)-assignment can be written Vi or V2, where V is 
some r-assignment. We have 

=> {Sci,['P] I V r-assignment} by induction hypothesis 
S4, =^ {S^[Pi], 5*0 [7^2] I V r-assignment} by Property 12 
= {SAV] \V' {r + l)-assignment} 

□ 



3.3.2 Going through clauses 

Now that each variable is assigned a boolean value, we need to verify with each clause that this assignment 

satisfies the formula (f>. This is done by selecting, for each clause, a literal which is true, and testing the 
corresponding lock. As in Definition 8, for any i S |1 ; A;] we write (aj, bi, Ci) the indices such that the i-th 
clause of ^ is A^. V A{,. V Xc^ (thus, ai,bi,Ci G |[1 ; m]). 

Definition 10. Let t & fO; kj and V be a full assignment. A f-selection a is a subset of |1 ; m] such that 

• \a\=t 

• for eachi G |1 ; tj, \{ai,bi,Ci} D a\ = 1 
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A t-selection a and a full assignment V — (T, F) are compatible, if, for every i G a, literal Aj is true 
according to assignment V (that is, Aj = Xj and j G T, or Aj = -^Xj and j G F). 

A k-selection is called a full selection. Given a t-selection a and a full assignment V = (T, F) which are 
compatible, we define the sequence S^[P,a] by: 



For all i e |1 ; /] , V- = 



ifi e T 

ifi&F 
(rj ifai e a 

For all i e [1 ; il , T'^ ^ if hi e a 

[rf ifaea 

0=\JPiU\jNi-a 

ieT ieF 

I = a 

50[^,^] = (7t+i,---,7fc,W,---,'^i',r;,...,r;,rt+i,...,rfc,£>i,...,A,Ai,...,Afc,AO) 

Property 14. Let V be a full assignment and t £ |0 ; kj, t < k. Let a' be a {t + 1) -selection compatible with 
V, then there exists a t-selection a compatible with V such that a d a' . 

Proof. It is obtained by cr = cr' — {at+i,bt+i,Ct+i}. It is trivially a t-selection included in a, and it is 
compatible with V (all selected literals in a are also selected in a', and thus are true according to 7-*). □ 

Property 15. Let t G ; kj, t < k, V be a full assignment, and a be a t-selection compatible with V. 

S^[P, a] => {S^[P, a'] \ a' (t + l)-selection compatible with V;(t C a'} 

Note that the right-hand side can be the empty set, in which case S^[P,a] 0. 

Proof. First note that there are 3 + l)-selections such that a C a', and they are a[ = a [J {a^+i}, 
(72 = o" U {fej+i}, and Cg = U {ct+i}. Since a is compatible with V, (j[ is compatible with V iff literal Xat+i 
is true in V (and similarly with couples (o'2,A(,j^J and (cg, Acj^J). We now define sequences X and Y and 
sets I and O such that S^lVja] = (it+i, X, Tt+i, Y, Af), that is: 

X = {jt+2,---,7k,Vl, . . .,v{,r[, ...,T[) 

y = (rt+2,...,rfe,A,-..,A,Ai,...,Afe,> 

0=[jP,u[jN,~a 

ieT ieF 

I = a 

Using Property 10 on clause gadget (7^+1, F^+i, A(+i), we obtain: 
where T is defined by: 

{x,rU„Y,A^j^[lX]^)eT iff a,+ieO 
<x, r,\i, r, Afj/,^;+//) e T iff e o 
(x,r?+i,F,A°-/^-+^/)eT iff c,+ieO 

Note that flf+i ^ (7, hence at+i € O iff Eli G T s.t. aj+i G Pi or 3i G F s.t. at+i G Ni. Equivalently, 
at+i G O iff Xat+i is a positive occurrence of a variable assigned True in P, or a negative occurrence of a 
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variable assigned False in V. Finally, aj+i G O iff crj is compatible with V. Likewise, 6t+i & O \S a'^ is 
compatible with V, and Ct+i e O iff as is compatible with V. 

S^[V, a[] = {X, rj+i, Y, A-fj[\^'^'y^) e T iff (j[ is compatible with V 
S4'P,cr'2] = {X,Tf^^,Y, A'^J^f'^'+^'^) €i: iff is compatible with P 
[P, = {X, r3+i , Y, Afj-/;;+^/ ) e T iff 4 is compatible with V 

Thus T is indeed the set of sequences S,f,[P,a'] where a' is a + l)-selection which contains a and is 

compatible with V: the property is proved. □ 

With the following lemma, we ensure that after the truth assignment, every efficient path starting from 
needs to select a literal in each clause, under the constraint that the selection is compatible with the 
assignment. 

Lemma 16. Let V be a full assignment. Then 

S^[P] {S^[P,a] I a full selection compatible with V} 

Proof. The proof follows the same pattern as the one of Lemma 13, that is, we prove 

S4,[P] {S^lVja] I a f-selection compatible with V} 

by induction for all t G |0 ; kj, and the lemma is deduced from the case t = k. 

There is only one 0-selection, which is ctq = 0, it is compatible with V, and S^[P] = S^lVjCro]- Consider 
now any t < k. We have 

Sfi,[P] => {S^[P,a] I (T t-selection compatible with V} (by induction hypothesis) 

S^[P] => {S^[P,(7'] \ a' {t + l)-selection compatible with V and 

3a i-selection compatible with V, <t C (t'} by Property 15 

= {S^[P,a'] \ a' {t + l)-selection compatible with V} by Property 14 

□ 

3.3.3 Beyond clauses 

Lemma 17. Let P be a full assignment and a be a full selection, such that V and a are compatible (provided 
such a pair exists for (j)). Then 

Proof. Write V = (T, F). Since cr is a full selection, S^\P, a] can be written (see Definition 10): 
For alH e [1 ; Zl , ¥[ 

For all i e |1 ; fc] , F^ 

0=|JPiU|JiVi-a 

I = a 

^0[^,^] = (n',---,v'/,r'i,...,F;,A,...,A,Ai,...,Afc,A?> 




24 



We extend the definition of set O to O^, for any r e |[0 ; /], as follows: 

Or= U (Pi U iV,) U y P, U y TVi - (7 

Note that Oq = O, and that O; = [l ; m] — a. 

<7] = (V/, . . . , V;', r'l, . . . , r'fe, Di, . . . , A, Ai, . . . , Afc, A?") 



I t 

( V3', . . . , y/, r;, . . . , r;,i3\,i||, • • • , A, Ai, . . . , a^, a?^) 



g=^^ (F^', . . . , VI, ri, . . . , r;,i3\, £>2 . . . , A, Ai, . . . , Afe, A?^) 



9.b/c 

/p/ w 7-1 7-32 ^31i-30 A A, A°'\ 

g l^y^ \J- 1> •••>-'- fc5-'-31'-'-62' ••• '-^-311 ,'^l)---i^fe,-'i/ / 



= (r;,...,n,T3\„Ai,...,A,,A?') 

Finally, for the last part, we use a similar procedure, with the following sets, for is |0 ; A:]: 



= 11; mj- aU {a^hud} 

\ 0<i<t 



0<i<t 



Note that O'q = Oi, Iq = I, O'^ = 0, = Jl ; m], and more importantly, for i > t, assuming that € ct 
(cases hi & a and Ci & a are similar), then Ui G I[, bi £ 0[ and Ci S Oj. Hence we can successively apply 
Property 11 (either .a, .b or .c) on each clause gadgets. 

<r;,...,r'fe,i3\;,Ai,...,Afc,A°°) 

^ /w p/ T-1 T-31/ + 1 A A, a'-'i\ 

X-"- 2' ■ ■ • ' ^ fc'-'-31i'^31i+62' ■^2; • • • , ^fe, / 

, /p/ p' 7-1 t31' + 1 t31'+63 a A a'-'sX 

=^ 3) • ■ • ) fe'-^31ii-^31(+62'-^31J+124' ■^Sj • • • , ^/c, / 

^31i+l ^31t+63 ^31i+62fe-61 a \ 

\-^3i;'-^31J+62'-^31J + 124' * * * ' -'-31J+62fc ^ I'k ' 

_ (ji j3i;+i A \ 

— \-^31b-^3i;+62fc'^^|l;ml/ 

_ /7-I 7-31; + ! 7-3i;+62fe+l \ 

— \-'-31^-'-31i+62fc'-'-31i+62fe+12m/ 

= 



□ 



Theorem 18. 



50 iif (f) is satisfiable. 



Proof. Assume first that I^. By Lemma 13, since {5*0 [P] | P full assignment}, there exists 

a full assignment P = {T,F) such that the path from 5^ to the identity uses S^[V]. Note that S^[P] 

Now, by Lemma 16, since S^\P] => {S^\P,(j] \ a full selection compatible with P}, there exists a full 
selection ct, compatible with P, such that the path from ^'^[P] to the identity uses S^\P,a\. Consider the 
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truth assignment Xi := True <^ i ^ T . Then each clause of (j) contains at least one literal that is true (the 
literal whose index is in a), and thus is satisfiable. 

Assume now that (j) is satisfiable: consider any truth assignment making (j) true, write T the set of indices 
such that Xi = True, and F = \1; l\ — T. Write also a a set containing, for each clause of (/i, the index of 
one literal being true under this assignment. Then a is a full selection, compatible with the full assignment 
V = (T, F). By Lemma 13, there exists an efficient path from to S^[P]. By Lemma 16, there exists an 
efficient path from S^[P] to S^\P, cf\. And by Lemma 17, there exists an efficient path from S^\P,a\ to the 
identity. Thus sequence is efficiently sortable. □ 

Using Theorem 18, we can now prove the main result of the paper. 

Theorem 19. The following problems are NP-hard: 

• Sorting By Prefix Reversals (MIN-SBPR) 

• deciding, given a sequence S, whether S can he sorted in db{S) flips 

Proof. By reduction from 3-SAT. Given any formula (f), create (see Definition 8, the construction requires 
a linear time). By Theorem 18, the minimum number of flips necessary to sort is db{S^) iff (j) is satisfiable. 

□ 

4 Conclusion 

In this paper, we have shown that the Pancake Flipping problem is NP-hard, thus answering a long-standing 
open question. We have also provided a stronger result, namely, deciding whether a permutation can be 
sorted with no more than one flip per breakpoint is also NP-hard. 

Among related important problems, the last one having an open complexity is now the burnt variant 
of the Pancake Flipping problem. An interesting insight into this problem is given in a recent work from 
Labarre and Cibulka [13], where the authors characterize a subclass of permutations that can be sorted in 
polynomial time, using the breakpoint graph [1]. Another development consists in trying to improve the 
approximation ratio of 2 for the Pancake Flipping problem, both in its burnt and unburnt versions. 
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